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I— I Abstract: The signal for a highly boosted heavy resonance competing against a back- 

pg ground of light parton jets at the LHC can be enhanced by analyzing subjets in the "fat" 

^ jet that possibly contains the heavy resonance. Three methods for doing this are known 

as filtering, pruning, and trimming. We study the possibility of combining these methods 



■^ using a relative likelihood approach. We find that, because the methods are not the same, 
one achieves an enhanced statistical power by combining them. We illustrate the possibil- 

f^ ities first with a simple problem of combining trimming and pruning to enhance the signal 

O for finding a boosted top quark. We then study the more difficult problem of disentangling 

. . from the background the signal for the production of a Higgs boson in association with a 

. !^ Z-boson. For this problem, we combine filtering, trimming, and pruning. 

X 
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1. Introduction 

A central problem for data analysis at the Large Hadron Collider (LHC) is to find the 
signal for the production of a new heavy particle against a background of jets produced by 
standard model processes that do not involve the heavy particle. An important example is 
the production of a Higgs boson in association with a Z-boson, where the Z-boson decays 
into e^e~ or /U"'^/x~ and the Higgs boson decays into hh. One can demand that the Z-boson 
has large transverse momentum, say Pt > 200 GeV. Since it recoils against the Z-boson, 
the Higgs boson then has a large transverse momentum and is easier to find than if it had 
low transverse momentum [1]. Nevertheless, there is a large background to this process 
from standard model processes that do not involve the Higgs boson, so some ingenuity is 
required to separate the signal from the background. 

Three methods have been proposed for this sort of problem: filtering [1], trimming 
[2], and pruning [3, 4]. These methods are not the same and, because they are not the 
same, there is more information available in combinations of two or three of them than is 
available in any one method. We will illustrate this in this paper. We will find that, by 
combining methods, we can enhance the significance of the signal. 

There are other cases of searches for as yet undiscovered physics in which the easiest 
sort of data analysis would involve simply looking for a bump in the mass distribution 
of jets found with the kx jet algorithm or a similar standard algorithm. In such cases, 
filtering, trimming, or pruning the jet can help, but may still produce a barely sufficient 
statistical significance for finding the signal with the integrated luminosity that is available. 
In such cases, combining methods as described in this paper can further enhance the signal. 

We combine methods using a likelihood analysis. We will also see that methods can 
be combined using a cut-based analysis, but the likelihood method is more powerful. 

The analysis of this paper applies in general to processes in which the signal for a highly 
boosted (i.e. high transverse momentum) heavy resonance competes against a background 



of light parton {g, u, u, d, d, s, s, c, c, b, b) jets. We begin in section 2 with a simple example 
in which we combine trimming and pruning to enhance the signal for a highly boosted top 
quark that decays into hadrons. This example gives us a chance to outline briefly what 
trimming and pruning are and to then illustrate how they can be combined. We then 
turn, in section 3, to the more challenging ZH production process. Here we use filtering, 
trimming, and pruning. Some conclusions follow in section 4. In appendix A we review 
some basics of the likelihood analysis that we use. 



2. Top quark identification 

This paper concerns detecting Higgs boson production in association with a Z-boson. Our 
analysis of ZH production involves three methods of subjet analysis, known as trimming, 
pruning, and filtering, the last with tagging of subjets containing a 6-quark. In addition, 
we use the likelihood ratio both as a measure of statistical significance and as a tool for 
combining different methods for detecting the same signal. In order not to introduce too 
many ideas at once in a somewhat complex analysis, we choose to introduce in this section 
some of the needed concepts: trimming, pruning, and our use of the likelihood ratio. 
We introduce these tools with the aid of a simple but quite artificial analysis involving 
the identification of a top-quark jet. We note that the reconstruction of boosted top 
jets has been considered in many different subjet analysis before [3, 4, 5, 6, 7, 8, 9, 10]. 
We emphasize that the "signal" that we study is not realistic, nor do we apply sensible 
cuts to define the signal and background event samples. Rather, we analyze "signal" and 
"background" event samples only to introduce some of the conceptual ingredients that we 
need. Once we have seen these ingredients, we can turn to ZH production in section 3. 

For this study, we use AlpGen [11] to generate "signal" and background events and 
we shower the events using Pythia 6.3 [12] and recombine the jets using Fast Jet [13]. We 
have in mind a scenario where a resonance in the s-channel {e.g. from a strongly coupled 
sector [14]) splits into tt. To mimic the signal of this scenario, we generate events with tt 
pairs in which the t and the t have large transverse momenta. We use the standard model 
process gg — )• ti to generate the top quarks, but with half the standard model differential 
cross section and with a cut Pt > 350 GeV for the hardest of the t or i. We select events 
in which one of the top quarks decays to biv, i = e ov fi, with Px/ > 15 GeV. The other 
top quark decays hadronically, to bqq. The total signal cross-section that we generate with 
these cuts is 1.5 pb. We generate the background events using standard model production 
of a M^-boson (with W — )• iv) recoiling against light parton jets. Events are selected for 
the analysis if the largest Pt jet (with the inclusive Cambridge- Aachen algorithm [15] with 
R = 1.5) has Pt > 100 GeV. Here and throughout this paper we take -^/s = 14 TeV. 

Having generated events, we now set the problem: to find the top quark that decays 
hadronically against the background of light parton jets that recoil against a VF-boson.^ To 



^If we really wanted to do a good job of finding the signal that we have chosen, we would impose a stiff 
Pt cut on all jets chosen for analysis. For our pedagogical purposes, we choose not to do this. 



start, we find the largest Pt jet in the event. ^ This jet should contain the decay products 
of the top quark if there is a top quark. To make sure that the decay products are well 
contained, we should use a jet finding algorithm that uses a fairly wide angular range. 
There is some choice here. We use either the inclusive Cambridge- Aachen algorithm [15] 
with a cone size R = 1.5 or the anti-Zcr algorithm [16] with R = 1.5. Because of the large 
angular size used in the jet finding algorithm, we call this the fat jet. 

The simplest way to proceed from here would be to measure the invariant mass Mj^t of 
the fat jet, expecting to find Mjet ~ Mtop = 174 GeV. However, jets from the background 
sample with this angular size can have large masses. Thus we expect that the distribution of 
Mjet for background events will be substantial around the region of interest, Mjet ~ Mtop. 
Furthermore, we cannot expect the signal events to yield a narrow peak near Mj^t = Mtop 
because the fat jet will inevitably contain hadrons from partons that originate in initial state 
radiation and from secondary interactions in the underlying event. These extra hadrons 
add to Mjet and thus smear the signal distribution."^ For these reasons, we need to break 
the fat jet into subjets and analyze the structure of the subjets. 

Consider first the trimming method [2]. Here, following Ref. [2], we define the fat jet 
using the anti-Zcj' algorithm with R = 1.5. The fat jet is made of constituents that we can 
take to be individual hadrons or else very narrow jets made from calorimeter towers. Let 
us call them the starting protojets. We now apply a sequential clustering algorithm to the 
protojets, grouping them into successively fatter protojets. There is a choice of algorithm 
to use. We use the kx algorithm [17] with protojet recombination defined by adding the 
four-momenta of the protojets. This algorithm has an effective cone size R and here we 
choose a quite small cone, R = 0.2. After the kx algorithm has combined the starting 
protojets up to a kx limit defined by this R, we have a list of jets, each consisting of some 
subset of the original starting protojets. There may be, say, ten final jets. We are ready 
to trim our list of jets, keeping relatively hard jets and throwing away relatively soft jets. 
We keep jet j if 

PT,j>fxA , (2.1) 

were the hard scale A is the Pt of the fat jet. The fraction / is an adjustable parameter 
that we take to be / = 0.03. The starting protojets i contained in the jets j for which the 
inequality (2.1) holds constitute the trimmed jet. Now we measure the invariant mass of 
the trimmed jet, 

r2 




Mit=\}v^ . (2.2) 



For background events, trimming reduces Mj^t for each event and thus reduces the high 
Mjet part of the jet-mass distribution. For signal events, trimming removes extraneous 
parts of the jets, giving a sharper peak near Mjet = Mtop- The result is illustrated in 



^Jets are considered only if the absolute value of their rapidity y is less than 5. This is a very non- 
restrictive cut. However the highest Pt jet is quite likely to have |y| much less than 5. 

^Indeed, there is not even a clear distinction between partons radiated from the initial state and from 
the top quark and its daughters because the quantum amplitudes that represent these two sources can 
interfere. 



figure 1. The tt signal is clearly visible. We will investigate the statistical significance of 
the signal shortly. 
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Figure 1: Mass distribution of trimmed jets for the tt signal and the W + jets background. The 
top mass is taken to be 174 GeV. 

Now consider the pruning method [3, 4]. Here, following Refs. [3, 4], we define the 
fat jet using the inclusive Cambridge- Aachen algorithm with R = 1.5. We again take the 
fat jet to be composed of very narrow starting protojets and apply a sequential clustering 
algorithm to the protojets. This time, we choose a modified version of the Cambridge- 
Aachen algorithm [15].^ In this algorithm, in each step we look for the pair of protojets 
{i,j} for which 



R: 



1,3 



iVi - Vj) + 



- ^jV (2.3) 

is the smallest. (Here yi is the rapidity of protojet i and (pi is its azimuthal angle.) This 
pair of protojets is combined by adding their four- momenta, creating a new protojet. 
The normal Cambridge- Aachen algorithm continues until no pair {i,j} of protojets has 
Rij < Dcut, where -Dcut is a parameter that represents an effective cone size for this 
algorithm. We take -Dcut = M(fat jet)/Pr(fat jet) and let the algorithm run until it stops. 
At this stage, each pair {i,j} of protojets has Rij > Dcut- Now we let protojet combination 
continue, but with an additional restriction: for each pair {i,j} of protojets that are ready 
to be combined, we look at the momentum fraction 



m.m{pT,i,PT,: 



(2.4) 



\PT,i + PT,j I 

If z is small, there is a danger that we are including a protojet that is extraneous to the 
signal. Therefore, if 

z < Zcut , (2.5) 



^Other successive combination jet algorithms are allowed for the pruning method. With the use of the 
Cambridge- Aachen algorithm, the description of how the method works is somewhat simplified. 



we do not combine protojets i and j and instead simply drop whichever of the two protojets 
had the smaher transverse momentum. For this tt analysis, we take Zcut = 0.1. Then we 
continue with the algorithm until all protojets have either been combined or else eliminated. 
The starting protojets contained in the final jet when the algorithm stops constitute the 
pruned jet. Now we measure the invariant mass of the pruned jet, 
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(2.6) 



The motivation for pruning is essentially the same as for trimming. For background events, 
pruning reduces Mj^t for each event and thus reduces the high Mjet part of the jet-mass 
distribution. For signal events, pruning removes extraneous parts of the jets, giving a 
sharper peak near Mjet = Mtop. The result is illustrated in figure 2. The tt signal is 
visible, perhaps less so than with trimming. 
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Figure 2: Mass distribution of pruned jets for the tt signal and the T4^ + jets background. The top 
mass is taken to be 174 GeV. 



In figures 1 and 2, the ti signal is obvious. However, these figures represent theoretical 
cross sections. Data will look different. The simplest way to look for the signal in data 
is to define a mass window and count the events in the window. Let us take our mass 
window to be 160 GeV < Mj^t < 200 GeV. Let n be the number of events in this window 
after a certain amount of integrated luminosity has been accumulated. We will take the 
integrated luminosity to be JdL = 30 pb~^. Let b be the expectation value of the number of 
background events with this luminosity and let s be the expected number of signal events. 
That is, b and s are the theoretical cross sections times JdL. 

To assess what we learn from the measurement in the face of counting statistics, we 
consider that there are two competing interpretations of the data: that it all arises from 
the W + jets background (theory B) or that it arises from this background plus the tt 



signal (theory SB). The probabiHty that n events are measured if there is only background 
is b^e~^/n\. The probability that n events are measured if there is a signal plus the 



background is (6 + s)"e 
exp{C) where 



^(s+b) / \ 



/n\. The ratio of these, known as the likelihood ratio, is 



C{n) = nlog ( 1 + 



(2.7) 



If £ is substantially greater than 1 , the result strongly favors the interpretation that the tt 
signal is present. For instance £ = 4 favors the presence of the signal by a ratio exp(£) w 55. 
We review some properties of the likelihood ratio in appendix A. 
The expectation value of C{n) if the SB theory is right is 



(£(n))sB = (s + 6) log 1 + 



(2i 



Thus, we can expect to reliably see the tt signal if (£(ri))sB is substantially greater than 1. 
As a minimum requirement, we may ask for (£(7t,))sb > 4. The results are shown in table 
1. We see that trimming does better than pruning, but neither method provides enough 
statistical power to achieve (£(n))sB > 4 with an integrated luminosity of just 30 pb~^. 
(Of course, the statistical insufficiency goes away with more luminosity, but in this simple 
example we imagine that 30 pb^ is all the luminosity that we have.) 
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Table 1: Statistical significance of trimming and pruning results for an integrated luminosity of 
30 pb~ . Here we simply count the expected number of signal events, s, and background events, b, 
in a top quark mass window 160 GeV < Mjet < 200 GeV. The logarithm of the likelihood ratio 
based on these expected counts is (£(n))sB, eq. (2.8). 

It is rather artificial to base the SB vs. B assessment on simply the counts in a single 
jet mass window. The experiment will give counts nj in each bin J shown in figures 1 
and 2. We can base our assessment on the log likelihood ratio using all of the information. 
Then the likelihood ratio is the product of the likelihood ratios for all of the bins used. Its 
logarithm is 



AW) = E 



nj log 1 + 



sj 



sj 



(2.9) 



Here nj is the number of events in bin J and sj and bj are the corresponding signal and 
background cross sections times the integrated luminosity. 

The expectation value of C{{n}) if the SB theory is right is 
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{sj + bj)log{l + 
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(2.10) 



Using the full bin-by-bin information, we find 



(^({^}))SB 



4.4; 

2.4, 



trimming , 
pruning . 



(2.11) 



We see that using the full available information improves the discriminating power of the 
experiment. In fact, now the log likelihood ratio with trimming is above our nominal 
threshold of 4.0. 

Although trimming and pruning are rather similar in spirit, they are different. This 

is illustrated in figure 3. In the left-hand plot, we evaluate the trimmed jet mass, MjJ^, 

(p) ~ 

and the pruned jet mass, Mj^^^, for each simulated tt signal event. We accumulate events 

in bins of {MjJ^,Mj^l} and plot the resulting density of events. We see from the plot that 



the bins with the most events do not lie along the diagonal, M 



it) 



Jet 



m]^], in this plot. In 

fact, the bins with the most events have MjJ^ < Mj^^. In the right hand plot, we do the 
same thing for the background events. Again, the most populated bins do not lie along the 
diagonal (or along any one-dimensional curve). 
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Figure 3: Trimmed jet mass and pruned jet mass in the tt production process and the background 
process W + jets. The left plot shows the signal for Aftop — 174 GeV; the right plot shows the 
background. Generally, trimmed jet masses are smaller than pruned jet masses, but there is no 
fixed relation between them. 

Because trimmed jet masses and pruned jet masses contain different information, it 
may be possible to obtain stronger results by using both of them. The simplest way to do 
this is to measure the number of events for which both masses fall into the top quark mass 
window 160 GeV < Mjet < 200 GeV. In this case, we obtain the results summarized in 
table 2. We see that there is some improvement in the statistical significance compared to 
the results in table 1. Additionally, s/b is larger when trimming and pruning are combined. 
This is important if the normalization of b is not precisely known. 

Evidently, we could also try to improve the statistical significance by adjusting the 
mass windows used for the trimmed and pruned jet masses. We do not, however, pursue 
this avenue. 





Trimming + pruning 


Signal cross section [fb] 


360 


Background cross section [fb] 


508 


s/h 


0.71 


s/Vb {JdL = 30 pb"^) 


2.8 


(£(n))sB (JdL = 30 pb-i) 


3.1 



Table 2: Statistical significance of combined trimming and pruning results for an integrated lumi- 
nosity of 30 pb~ . Here we simply count events in which both trimmed jet mass and the pruned 
jet mass fall into the top quark mass window 160 GeV < Mjgt < 200 GeV. The notation is the 
same as in table 1. 



Instead of combining the trimming and pruning information based on the event count 
in a single window, we use the log- likelihood C, eq. (2.9), based on all of the bins in figure 
3 that contain a background cross section of at least 0.5 fb.^ We find 



(^(M)) 
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6.2, 



trimming -|- pruning 



(2.12) 



This is a significant improvement on the log likelihood ratio that we obtained with either 

trimming or pruning alone, eq. (2.11). 

We can extend the analysis so as to display more information. The number of signal 

events in each bin is a function sj{m) of the top quark mass m that we use to calculate 

the ti signal cross section. Until now, we have taken m to be Mtop = 174 GeV. However, 

we can let m vary. We consider the choices m = (145, 155, 165, 174, 185, 195, 205, 215) GeV. 

For each choice, we construct C{{n}^m) according to eq. (2.9). Then, if we were to use 

data for the number of events nj in each bin, we would test not only whether the SB theory 

is favored over just the B theory, but also which values of m are favored or disfavored by 

the data. To display what can be expected on average, we show in figure 4 the expectation 

value of £({n}, TTi) in the SB theory with the true top quark mass, Mtop = 174 GeV. That 

is, 

sj{my 



(£(M,m)) 
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(sj(Mtop) + 6j)log 1 + 



hj 



sj{m) 



(2.13) 



The results are plotted in figure 4 as a function of m. We show the results for trimming 
alone, pruning alone, and for trimming and pruning combined. We see that the SB theory 
with in = Mtop is highly favored, with a stronger result obtained if we combine trimming 
and pruning. We also see that the result using trimming and pruning combined is quite 
sensitive to the value of m: m = M^p is favored, while m = 165 GeV and m = 185 GeV are 
not favored. For these wrong values of m, (vC({n},m))sB is close to 0. For m = 155 GeV 
and m = 145 GeV, the signal -|- background theory with the wrong m is even weakly 
disfavored compared to the background only theory. 

One should not think that figure 4 is what data will look like. We plot the expectation 
value of £({n},?7i), but the values of the counts nj are subject to fluctuations. From 
appendix A, the variance of C{{n}^ m) is 

^The results are not sensitive to this cut, whicli we impose so that we can have a rehable calculation of 
sj/bj. 
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Figure 4: The log likelihood ratio in tt production as a function of the trial top quark mass to, 
assuming an integrated luminosity of 30 pb~ . We construct £({n},TO,) for eight different values 
of TO,. Then we take the expectation value of these quantities in the signal + background theory 
with the true top mass, Mtop. The results are shown for trimming alone, pruning alone, and for 
trimming and pruning combined. 
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Using the log likelihood results for trimming and pruning combined from figure 4, we plot 
£ lb [((£ — (£)sb)^)ot^]^' ^ as an error band in figure 5. Then we display five sample curves 
for C({n}) in which the counts nj in the bins J are drawn from Poisson distributions 
with mean sj{Mtop) + bj. We see that the SB theory with the right mass is generally 
favored, but that it can be more or less favored depending on whether the counts in the 
most important bins fluctuate up or down. 



3. ZH production 

We now turn to the production of a Higgs boson in association with a Z-boson, where the 
Z-boson decays into e"'"e~ or ^^ ^~ and the Higgs boson decays into bb. This process was 
considered in ref. [1] and found to contribute to the overall signal significance for a Higgs 
boson search when Mh < 130 GeV. The idea is to demand that the Z-boson have large 
transverse momentum. Then the recoiling Higgs boson has large transverse momentum 
and is easier to find against the backgrounds even though the cross section for this process 
is small. The backgrounds that we consider are ZZ production and, most importantly, 
Z + jets production. In the part of our analysis that uses the methods of ref. [1], we find 
good agreement with the results of ref. [1]. Our purpose is to extend the analysis of ref. [1] 
by investigating the improvement in background rejection obtained by using more than 
one algorithm for the analysis of subjets. 
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Figure 5: Sample results for the log likelihood ratio in the tt production process as a function 
of the trial top quark mass m, using trimming and pruning combined and assuming an integrated 
luminosity of 30 pb^ . We construct C({n},m) for eight different values of m. Then we com- 
pute £({n}, m) for five random sets of the counts nj drawn from Poisson distributions with mean 
sj(Mtop) + bj. We also show an error band based on the mean value of C{{n}, m) and its variance. 
Most points are within the error band, but note that 2 a or larger deviations either upward or 
downward will sometimes occur. 

The HW production process is also important for a Higgs search, for the same reason 
as for ZH. However, this signal has an additional important background, ti production. 
In this paper, we restrict the analysis to the simpler ZH case. 

We generate the ZH^ ZZ and Z + jets samples using Pythia 8. We include an event 
in our sample if it has an electron or muon pair with 



80 GeV < mil < 100 GeV , 
PTAi > 200 GeV . 



(3.1) 
(3.2) 



The leptons are required to have rapidity |ry| < 2.5. We further require that there be no 
additional leptons with |ry| < 2.5 and px > 30 GeV. We examine events for jets using the 
inclusive Cambridge- Aachen jet algorithm with R = 1.2 and accept an event only if it has 
a jet with Pt > 200 GeV and \r]\ < 2.5. This is the same as the event selection in ref. [1]. 

Having generated events, we now analyze them to look for the ZH signal. We will use 
the trimming and pruning analyses described in the previous section and, in addition, we 
will use the filtering method. Thus we need to describe the filtering method [1], which has 
been applied several times in association with Higgs searches [10, 18]. 

To use filtering, we first look for jets in the event using the Cambridge-Aachen (C-A) 
algorithm with R = 1.2 and select the highest Pt jet, the "fat jet." Then we examine the 
fat jet for a mass drop. If we have a signal event, then one of the splittings in the C-A 
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splitting history is likely to be the H ^ bb splitting. To look for it, we start at the trunk 
of the splitting tree and look at the first splitting, J{ij} -^ Ji + Jj. If the jet mass change 
in this splitting is large enough, 

max{Mi,Mj) < fJ-M^ijy (3.3) 

with n = 0.67, and if the transverse momentum in the splitting is large enough, 
min(p2 ,P2,) 



Mf. ., 



[iVi - Vjf + (0i - ^jf] > ycut (3.4) 



with ycut = 0.09, then we say that the mass drop condition is met and proceed to the next 
stage of the analysis. If the mass drop condition is not met, we eliminate the daughter jet 
with the smaller P^ and examine the splitting of the daughter jet with the larger P^ to 
see if its splitting satisfies the mass drop condition. This process continues until the mass 
drop condition is met. 

It can be that the mass drop condition is never met. In this case the event is removed 
from the sample. This has the possibility of preferentially removing background events. 

If the mass drop condition is met, we apply a different analysis, called filtering, to the 
daughter protojets i and j. First, we are hoping that this was a H ^^ bb splitting, so we 
ask whether both protojets i and j are tagged as containing a b or b quark. For each of 
the two protojets, we assume a ^-tagging efficiency of 60% and a mistagging probability of 
2%. If one or both protojets i and j are not tagged as 6-jets, the event is removed from 
the analysis. If both protojets i and j are tagged as 6-jets, we apply the C-A algorithm 
separately to the constituents of both of these protojets. This time, we use a smaller cone 
size 

R = min Q [ivi - y,f + {(t>i - </>,)'] '^' , O.s) . (3.5) 

This procedure simply combines the branches of the C-A splitting trees for protojets i and 
j down to the level specified by this R. 

We arrive at a list of constituent subjets of jets i and j. We hope that the two highest 
Pt subjets thus found each contain one of the previously found 6 or 6 quarks, so we ask if 
they do. If we have a double 6-tag in this sense, we retain the event, otherwise we remove 
it from the event sample. 

We next look at the three highest Pt jets among the subjets of i and j. The constituents 

of these constitute the filtered jet. The final step of the filtering analysis is to measure the 

( f) 
mass Mj^^ of the filtered jet. 

Having explained filtering, we are now ready to compare filtering, trimming, and prun- 
ing and combinations of them. Here, we will be somewhat tethered to filtering even if we 

( f) 
do not use the filtered jet mass M^^^. That is because it has been shown [19] that the 

filtering method described above is an efficient way for selecting events with double 6-tags. 

Therefore, we use the filtering method to reject events that do not have double 6-tags. In 

( f) 
the 6-tagged sample, we can measure any of the filtered jet mass, Mj^^ , the trimmed jet 

mass, MjJ^, or the pruned jet mass, Mj^l_. We apply the trimming or pruning procedures 
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independently of any subjet information from the mass drop plus filtering procedure. We 
know that the event was selected because there was a suitable mass drop and the appropri- 
ate subjets were 6-tagged, but we do not ask that the subjets used in trimming or pruning 
have any particular relation to the subjets that had been found in the mass drop plus 
filtering procedure. For trimming, we follow the method outlined in section 2, defining the 
fat jet using the anti-ZcT algorithm, but with R = 1.2 instead of the value R = 1.5 used 
in section 2. For pruning, we follow the method outlined in section 2, defining the fat jet 
using the C-A algorithm, again with R = 1.2. For filtering, we follow the method outlined 
in this section. 

We note that in a given event, the three mass measures, M]^^ , MjJ^, and Mj^l are not 
the same. We illustrate this in figure 6, in which we plot the numbers of events in bins of 
{MfJ^,M^/J) (top row), {M^^l,M^/J) (middle row), and (MJ^J,m]*J) (bottom row). These 
distributions are shown for the ZH signal in the left hand column and for the background 
in the right hand column. In each graph, we see that knowing one of the two masses does 
not tell us the other. There is, of course, a correlation, but it is far from perfect. For that 
reason, there is information to be gained by measuring two of these masses for each event. 
(We do not have enough generated events to divide them into a three dimensional grid of 
masses.) 

With this in mind, we choose a mass window for each of Mj^^, M^J^_, and M^^^, namely 
Wf = (110 GeV, 125 GeV), Wt = (105 GeV, 120 GeV), and Wp = (110 GeV, 125 GeV). 
Then, to start, we assume an integrated luminosity fdL = 30 fb~ is available and we 

count the number of signal events s and background events b expected with the mass of 

( f) 
the filtered jet in its window, Mj^^ € Wf. The results are displayed in the first column of 

table 3. Using just this information, the expected logarithm of the likelihood ratio favoring 

the presence of the ZH signal along with the background, eq. (2.8), is {C{n))sB ~ 1-7. Then 

we ask that both Mj^j, and MJ^j. be in their respective mass windows. This cuts the number 

of signal and background events, but makes (£(n))sB larger, indicating a greater statistical 

significance for the measurement. Similarly, we find larger values of (i2(n))sB both for the 

combination of Mj^^ and Mj^^ and for the combination of M^^^ and MjJ^. We point out 

here that we are looking only at statistical significance from counting statistics, not at 

other sources of error. Additionally, we note that (£(n))sB ~ 2 is not nearly enough to 

claim a discovery of the signal; however, if one had (£(n))sB ^ 4 from another independent 

method, such as a search for WH production, then the ability to add 2 to this would be 

not insignificant. 

It is rather limiting to base the assessment of whether data favors the presence of the 
ZH signal in addition to the background on simply the counts in a single window in a pair 
of jet masses. As we noted in our example of tt production in section 2, the experiment 
will give counts nj in each bin J shown in figure 6. Again, we can base our assessment 
on the log likelihood ratio using all of the information.^ Then the likelihood ratio is the 



®We include all of the bins that contain a background cross section of at least 0.001 fb. The results are 
not sensitive to this cut, which we impose so that we can have a reliable calculation of sj/bj. 



12 



160| 



> 
O 



150 



140 



'130 



E ^^° 



■C 100 




160| 



120 



le-01 > 


'S 150 


o 


140 


16-02%-^ 


^^130 

0,120 

c 


•T3 


^ 110 

E 



100 110 120 130 140 150 160 



Filtering - MJf^' [GeV] 



le-03 



■C 100 



3^0 




100 110 120 130 140 150 160 

Filtering - M^^'^l [GeV] 



le-03 



160| 



160. 




le-01 > 

a; 
O 



: le-02 






100 110 120 130 140 150 160 

Filtering - mIj{1 



[GeV] 



'^0 100 110 120 130 140 150 160 

(/) 
Jet 



'le-03 



Filtering - m| 



fGeV] 



160| 







„ 160| 
le-01 > > 150 



: lG-02^ 



i 



^'^0 100 110 120 130 140 150 160 ^^'^^ 



Trimming - M- 



(t) 

Jot 



fGeV] 




Trimming - M- 



rif) 



Figure 6: Joint distributions between pairs of the filtered jet mass Af j^^ , the trimmed jet mass 
M]gj., and the pruned jet mass Mj^l for the ZH signal (left column) and the background (right 
column). The events were generated with Mniggs = 115 GeV. 



product of the likelihood ratios for all of the bins used. Its logarithm is given by 

sj{m)' 



£({n},m) = Y^ 



nj log 1 



sj{m) 



(3.6) 



Here we have included in the notation the fact that the expected number of signal events 
sj in a certain bin depends on the assumed Higgs boson mass, m. Given data {n}, one 
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M^il e Wf 


M^il e Wf 

Mfl G Wt 


m\^2 e Wf 
M^ll e Wp 


^j5 e Wp 


Signal cross section [fb] 

Backgrnd cross section [fb] 

s/h 

s/Vb {JdL = 30 fb"^) 
(£(n))sB {fdL = 30 fb-^) 


0.20 
0.30 
0.67 
2.0 
1.7 


0.18 
0.20 
0.90 
2.2 
1.9 


0.17 

0.17 

1.0 

2.3 

2.0 


0.17 

0.16 

1.1 

2.3 

2.1 



Table 3: Statistical significance of ZH results for an integrated luminosity of 30 fb^ . Here we 
simply count the expected number of signal events, s, and background events, b, in certain windows 
for the mass of the filtered jet, Af jg^. , the mass of the trimmed jet, Mj^j., and the mass of the pruned 
jet, M^ll The mass windows chosen are Wf = (110 GeV, 125 GeV), Wt = (105 GeV, 120 GeV), and 
Wp = (110 GeV, 125 GeV). The Higgs mass assumed when generating events is A^Higgs = 115 GeV. 
In the first column, we ask only that the filtered jet mass be in the window Wf. In the remaining 
columns, we combine methods by asking that two masses be in the corresponding windows. For 
each type of measurement, we show three measures of statistical significance, s/b, s/Vb, and the 
logarithm of the likelihood ratio based on s and 6, eq. (2.8). 



can test not only whether the presence of the ZH signal is favored, but how the likelihood 
favoring the presence of the signal depends on the assumed mass m. The expectation value 
of C{{n},m) if the true Higgs boson mass is Mniggs and the signal is present along with 
the background is given by 



{Ci{n},m)) 



SB 



Y, isj{Mmggs) + bj)log(l + 



sjim) 
bj 



sj{m) 



(3.7) 



We have computed {C{{n},'m))sB for nine assumed values of m and for the three combi- 
nations of using two out of three of the filtered jet mass, the trimmed jet mass, and the 
pruned jet mass. The results are displayed in figure 7. 

We learn three things from figure 7. First, if we look at the case m = Mniggs, we 

have a stronger signal using the distribution in two out of the three variables Mj^l , MjJ^ , 

(p) 
and Mjg^ together than we have for just one variable. Second, with the distribution in two 

variables, we have better resolution in which trial mass m best fits the data compared to the 

resolution obtained with just one variable. Finally, using M^J^ together with Mj^l, we have 

{C{{n},m))sB ~ 2.7. This is better than the corresponding result, {C{{n},m))sB ~ 2.1, 

from table 3, in which we used simply the number of counts in a fixed window in M\1 and 



M 



(p) 

Jef 



We have tried one more small adjustment. In the pruning method, Refs. [3, 4] recom- 
mend that the parameter Zcut, Eq. (2.5), be set to 0.1. That is the value we have used. 
However, we find that the value 0.05 does a better job in this application, as shown in 
table 4. Changing to Zcut = 0.05 allows the pruned jet to absorb more soft radiation. This 
enhances the asymmetry in the jet mass between pruning and trimming. Although the cor- 
relation of the jet mass for the signal process is weakened it mainly affects the background 
of light parton jets, see figure 8. 
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Figure 7: The log likelihood ratio in the ZH production process as a function of the trial Higgs 
boson mass m, assuming an integrated luminosity of 30 fb~ . We construct >C({n}, m) for nine 
different values of m. Then we take the expectation value of these quantities in the signal + 
background theory with the true Higgs mass, Mniggs- The results are shown using the the filtered 
jet mass Mj^^ alone, the trimmed jet mass Mj^J. alone, the pruned jet mass M^^l alone, and for 
M^j^l combined with m{^j\ for M^/J combined with m|*| and for M^^^l combined with m|*|. 



One can well be concerned that smearing of jet masses because of detector effects might 
affect the results presented here. To check, we applied Gaussian smearing on Mj^^, ^jJt 
and Mjg^ according to [20] but could not find sizable differences in the log likelihood ratio. 
More realistic finite jet resolution effects might change the quantitative statements in this 
paper, but a reliable simulation of them is beyond the scope of our work. Thus all results 
shown are without detector smearing effects. 



4. Conclusion 

In searches for a narrow boosted resonance in which the signal is small compared to a 
background coming from QCD induced light parton jets, the combination of pruning, 
trimming and filtering can help to extract the signal from the background. 
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Signal cross section [fb] 

Backgrnd cross section [fb] 

s/h 

s/Vh {JdL = 30 fb"^) 
(£(n))sB ifdL = 30 fb-^) 



r(p) 
Met 

0.16 

0.13 

1.3 

2.4 

2.2 



Table 4: Statistical significance of ZH results for an integrated luminosity of 30 fb^ as in table 
3 except that here we take Zcut in the pruning method to be 0.05 instead of 0.1. This improves the 
statistical significance compared to the (Mj^^, AfjJ) results in the rightmost column of table 3. 
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Figure 8: Joint distributions between pairs of the trimmed jet mass Afj^^ and the pruned jet 
mass Mj^l for the ZH signal (left column) and the background (right column). The events were 



generated with Mniggs = 115 GeV and Zc 



0.05. 



For a generic resonance tagger, one can look for excess events in a combined window 
of two of the pruned, trimmed, and filtered jet masses. Even if a new physics model for 
the signal is not anticipated, the approach outlined in this paper can be used to improve 
on the statistical significance of a so-called "side bin analysis." Both s/b and s/\/b can be 
improved for a given window around the resonance mass when different jet mass measures 
are used together. 

If one has good models for the expected background and the sought signal, one can 
gain further statistical significance by using a likelihood analysis based on the models for 
signal and background. 
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A. Analysis of data with relative likelihoods 

In this paper, we have made use of analysis of data using relative likelihoods. This method, 
in one form or another, is quite widely used. A description convenient for our use can be 
found in Ref. [21]. We provide a brief summary in this appendix. 

We suppose that we measure one or several variables v = {vi,V2, ■ ■ ■ , vl) for each event. 
We put the events into bins, with labels J, based on the value of v of each event. Let nj 
be the number of events in bin J. Then the result of the experiment is a list of the values 
{n} = {ni, • • • , TiAr} of the numbers of events in each bin. 

We suppose that we have a model (say, based on Pythia) for the expected number 
of events in each bin if there is no new physics signal. This is the background model, 
designated B. Let us denote the expectation value of nj in the background model by hj. 
We also suppose that we have a model for the expected number of events in each bin if 
there is a certain new physics signal. This is the model then includes both the background 
and the sought signal. We call this model SB. Let us denote the expectation value of nj 
in the signal plus background model as hj + sj. For the moment, we assume that there is 
no uncertainty in what models B and SB predict. 

Given model B, the probability to find result {n} is 



^B(M) = n;rT(^^)"'""'' • (^-^^ 



J ^^= 



Given model SB, the probability to find result {n} is 

PsB[{n}) =X{—,{h.j + sj^e-'^-'^ . (A.2) 



The ratio of these is the relative likelihood to find the observed result, 

R{{n)) = ff^ . (A.3) 

This ratio tells one how to modify a prior opinion about the probability that SB as op- 
posed to B holds in nature. Thus it is a convenient statistic to describe the results of the 
experiment. 

We can write the likelihood ratio as 

R{{n]) = exp£({n}) . (A.4) 

The logarithm of the likelihood ratio, £({n}), has a simple expression 

C{{n}) = Y,[nj\og{l + sj/hj)-sj] . (A.5) 



17 



The theory for signal and background can depend on parameters, so that sj and bj 
depend on the parameters. Then C{{n}) depends on the parameters. Given data {n}, we 
can adjust the parameters to find the version of the theory with the biggest C{{n}). In this 
paper, we consider the simple case in which there is a single parameter^ that we consider 
varying, a mass that we denote by m. The signal depends on m; the background does not. 
Thus 



£({n}, m) 



E 

J 



njlog{l + sj{m)/bj) - sj{m)] . 



(A.6) 



To understand this better, it is useful to consider the case in which sj <C bj and 
{nj — bj) <^bj in all bins. Then 

'sj{m) sj{mf' 



C=Y.\[bj + {nj- bj)\ 



bj 



265 



+ 



2^ < sj{m) + [nj - bj) —, h 



J 

E 

J 



bj 



{nj - bj)sj{m) sj{mf 



2bj 



sj{m) 
sj{m) 



(A.7) 



bj 



2bj 



This has a simple interpretation. We see that C is large when the observed signal {nj — bj) 
is correlated with the expected signal sj{m). That is, C is large when (rij — bj) > in 
those bins for which sj{m) > 0. There is a penalty contribution, sj{mf' /{2bj) for each 
bin. Thus, to keep C positive, {nj — bj) needs to be bigger than sj{m)/2 in the bins with 
expected signal. 

Suppose that the SB theory is correct if we set the mass to its true value Mtrue- The 
expected value of C{{n},m) in this case is 



(^) 



J 



[nj log(l + sj/bj) - sj] 



where 



nj = bj + sj{Mtruc 
In the case of small signal/background, we have 

sj{Mtruc)sj{m) sj{mf' 
J 
When we set m to Mtrue j this is 



(Q-E{^ 



bj 



2b 



J 



(A.^ 



(A.9) 



(A.IO) 



(£({n},Mtruc))~ J^ 



sj{M, 



true J 



2bj 



(A.ll) 



That is, what counts in this limit is sj j \/bj. 

^If sj /bj <IC 1 in the bins with the most signal, then it is important to know the normahzation of the 
background quite precisely. In this case, one might introduce a parameter A that represents the normaliza- 
tion of the background and use the data to fix A. 
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It is a simple matter to evaluate how the observed value of C{{n},m) fluctuates as- 
suming that the SB theory is correct when m = Mtrue- With Poisson statistics, we have 



(nj) = nj , 
Let us adopt the shorthand notation 



(A.12) 



L, = log(l + ^~) . (A.13) 



5J 

Then 

/: = Y,[njLj-sj] . (A.14) 

J 

Let us denote the expectation value of £ by £, 

Z = ^[njLj-sj] . (A.15) 

J 

Then the variance of C is 

= '^LjLK{{nj -nj){nK -uk)) (A.16) 

J,K 
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